56 research outputs found

    A method for extracting travel patterns using data polishing

    Get PDF
    With recent developments in ICT, the interest in using large amounts of accumulated data for traffic policy planning has increased significantly. In recent years, data polishing has been proposed as a new method of big data analysis. Data polishing is a graphical clustering method, which can be used to extract patterns that are similar or related to each other by identifying the cluster structures present in the data. The purpose of this study is to identify the travel patterns of railway passengers by applying data polishing to smart card data collected in the Kagawa Prefecture, Japan. To this end, we consider 9,008,709 data points collected over a period of 15 months, ranging from December 1st, 2013 to February 28th, 2015. This dataset includes various types of information, including trip histories and types of passengers. This study implements data polishing to cluster 4,667,520 combinations of information regarding individual rides in terms of the day of the week, the time of the day, passenger types, and origin and destination stations. Via the analysis, 127 characteristic travel patterns are identified in aggregate

    Asymptotic properties of parametric and nonparametric probability density estimators of sample maximum

    Full text link
    Asymptotic properties of three estimators of probability density function of sample maximum f(m):=mfFm1f_{(m)}:=mfF^{m-1} are derived, where mm is a function of sample size nn. One of the estimators is the parametrically fitted by the approximating generalized extreme value density function. However, the parametric fitting is misspecified in finite mm cases. The misspecification comes from mainly the following two: the difference mm and the selected block size kk, and the poor approximation f(m)f_{(m)} to the generalized extreme value density which depends on the magnitude of mm and the extreme index γ\gamma. The convergence rate of the approximation gets slower as γ\gamma tends to zero. As alternatives two nonparametric density estimators are proposed which are free from the misspecification. The first is a plug-in type of kernel density estimator and the second is a block-maxima-based kernel density estimator. Theoretical study clarifies the asymptotic convergence rate of the plug-in type estimator is faster than the block-maxima-based estimator when γ>1\gamma> -1. A numerical comparative study on the bandwidth selection shows the performances of a plug-in approach and cross-validation approach depend on γ\gamma and are totally comparable. Numerical study demonstrates that the plug-in nonparametric estimator with the estimated bandwidth by either approach overtakes the parametrically fitting estimator especially for distributions with γ\gamma close to zero as mm gets large

    A semiparametric probability distribution estimator of sample maximums

    Full text link
    Several approaches of nonparametric inference for extreme values have been studied. This study surveys the semiparametric probability distribution estimation of sample maximums. Moriyama (2021) clarified that the parametric fitting to the generalized extreme value distribution becomes large as the tail becomes light, which means the convergence becomes slow. Moriyama (2021) proposed a nonparametric distribution estimator without the fitting of the distribution and obtained asymptotic properties. The nonparametric estimator was proved to outperform the parametrically fitting estimator for light-tailed data. Moreover, it was demonstrated that the parametric fitting estimator numerically outperformed the nonparametric one in other cases. Motivated by the study, we construct two types of semiparametric distribution estimators of sample maximums. The proposed distribution estimators are constructed by mixing the two distribution estimators presented in Moriyama (2021). The cross-validation method and the maximum-likelihood method are presented as a way of estimating the optimal mixing ratio. Simulation experiments clarify the numerical properties of the two types of semiparametric distribution estimators

    ASYMPTOTIC PROPERTIES OF A KERNEL TYPE ESTIMATOR OF A DENSITY RATIO

    No full text

    Follow-Up of Patients Who Achieved Sustained Virologic Response after Interferon-Free Treatment against Hepatitis C Virus: Focus on Older Patients

    No full text
    Background and Objectives: Direct-acting antiviral agents (DAAs) have improved sustained virologic response (SVR) rates in patients with chronic hepatitis C virus (HCV) infection. Our aim was to elucidate the occurrence of hepatocellular carcinoma (HCC) and to compare the outcomes of patients aged 75 years or older (older group) with those of patients younger than 75 years (younger group) after SVR. Materials and Methods: Among 441 patients treated with interferon-free DAA combinations, a total of 409 SVR patients were analyzed. We compared the two age groups in terms of HCC incidence and mortality rates. Results: Older and younger groups consisted of 68 and 341 patients, respectively. Occurrence of HCC after SVR did not differ between the two groups of patients with a history of HCC. Occurrence of HCC after SVR was observed more in younger patients without a history of HCC (p < 0.01). Although older patients without a history of HCC had a higher mortality rate (p < 0.01), their causes of death were not associated with liver diseases. Among younger patients without a history of HCC, none died. Conclusions: After SVR, liver disease may not be a prognostic factor in older HCV patients without a history of HCC
    corecore